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Eyaluating a Bilingual Test: Adding the Consumer's . 

• ■ .^.jPoint .of "View ■ ^j.- 



^Since the spring o 1979, the Hoiio^i'^^^trict in Hawaii, h|s been . 
usiiig the Basic Ihvehtbrsj of Natljral^'Language'' tBI^N^ to measure English 
language proficiency of students participating in tR^X^^^dehts of Limited 



English Proficiency (SL^P);' Program. The test was/S^i^$^^§ hy th 
district as an instrument for screening; diagnosis, placemeiftji^^^nd 
\ evaluation. The purpose of the present study -was t^revie^^ %varua|e 
the BINL to help determine whether the cbntihued use of the test in;- the ' v 
-district is warranted. 

The SLEP Program is designed to serve students whose dominant 
language is not English arid-^ose limitation in the use of English 
prevents them from ^urictibh^^^%^*€ctively in the regulai/ classroom. The 
bvefrall bl?jective bf the program fs to help these students tb adjust to * 

; _ ' '^J--.-*' i- " ^ ' - 

the'ftmerican culture iri the Hawaiian- settin^byJaacquiring* basic • 

■ • " ■ ' . i ' > ■ ^- ■- * " ■ ' _^ V ■ 

■ ■ --■ - - - ■ ■ • ■ ■ ':■ : - - ^ - ^- ■■- - : -■- : 

cditmiUnicatlbn skills to participate in the regular classroom inHfrucJtion 

arid school activities apprbpriate, for *the their age and grade level. ' 

students are selected to participate iri the prbgram bri the basis qf . 
their language" dominance ratings aS determined *by criteria specified in / 
the identification Assessment Programming System (Hawaii Department rof 

n _ ■ ■ 

■ _ _ . . ■ _ _i . __ , . _ _ _ _ s J ■ - - 

Education, 1980). Only students who receive language dominance^ratings 
of 1 arid 2 are eligible tb participate iri the prbgram. Participarits are 
.exited frcah ihe program when they reach a language dbminance rating bf 3 

(or above) and score at the 25 percentile (or above) on the Metropolitan 

- . ^' * - • ^ - - ^- \ - . _ .... ■ 

Ach4fevemerit Test iri readirig, language arts, and mathematics. 



Currently^ the SLEP Prograin is offered in 55 schools in th^ diit^ic^p^ 
•serving some 5,000 students. Prdgtam staff ihciude 7 yermanient teachet^ - 
51 .temporary teachers, 8 school home aides, .15 educational assistants^ " v 

' arid 47 part-time temporary teachers. PrograS funding Us ^p^jpximateiy 

$1.3 miiiion per year; V , ' 

^- ■ . . , • ■ ■ « ■■ ' . 

The BINL piirports tol^easure language prof icie.ncy in terms of' the . 
cpmplexity of the language u^ed by students in grades k-12. The test 
items consist of 40 large story starter ^ctures; The student selects 3 
to 5 pictures and responds to ibhe items by making up stories about the ' 
pictures or about some of th4 people and things in the |>ictures. The • 
piotufes may be used as a jum|>ihg off point for^the sttrdeht to speak 
about a personal experience. Student responses are tape recorded to 
]^ facilitate^ scoring. « ■ ■ ^ 

' The BINL scores are based on 10 speetih samples taken from the student 
during the test. The test provides a series of scores which may be used 
to build a" language prof ile for each student. Fluency is indicated by 
the total number of Words used by the student duririg lariguage^^i^^ing. 
j^v^^ This score is said to be a general indicator of ability to use the 

■■^^pcabulaty, structures and forms of a language. Level of ^complexity is a 



measure df the student's cdininahd of the structures of the language, 
iricludirig the use of modifiers^ phrases arid clauses. Average seriterice 



. .length'^^ is ^ meagre of the fluency count and the member of phrases or 
sentences used by the student. 
• ' The BINL cdmpiekity level scale ranges from 0 to 200. Average 
seritence length ranges from 0 to IS-words. Based on score ranges> 
language proficiency categories are established. These include j 



non-Engiish speaking^ iimited English speakings fluent English speakings 
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and proficient Ehglish^^^ s Separate- score ranges are used to 

establish the 'categories for different gfade iev^i groupings ^{^ 

3-6^ 7-8^ 9fl2) i Students in the early grade's are generally not expected' 

to reach high school leyels- of complexity. , ^ • - s ^ \ 

• The BINL is adniihistered individually. Test admiriistratlon geni^aiiy ^ ; > 
takes ia-15 niihutes. The test is both hand- and inachine-scbrable', - ' 



•■PROCEDURES /^■^ ■ ^ : -" ■ ■ 

The Study looked at four critical aspects of the BINfi^ First, the 
validity of the test^ prlinar i'ly its content validity, ;wai ekamined^:: 
Second, the test-pretest reliability of. t BINL iwas assessed. Thitd^ th 
effects of the BINL as ^ measure for exiting prdject students was studied 
in terms of the studeritsVpbst-SLEP performance in the regular | 
classroom. Fourth^ a test use survey was conducted- tp find out hbw 
satisfied the project staff were with the us^ of the BINli. Spedific | 
procedures u^e^ in the study are described aSs^fbllowsi 

A. Test Review Cbiranittee 

■ ■ ■ ■ - ' ■. ■ " '■ ■ : ' ■ ' '■■ '•■ 

The fornlation of a test review committee was a critical step in the 

^tudy. The committee was charged with assessing the content validity and 

other psychometric qualities of the BINL bh the basis bf pre-speci^ied- 

criteria. Cdiranitt^e members w^re selected bh the basis bf: • • 

- backgrbuhd in bi lingual educatio^, 

- knowledge of \^and experience with test and measurement ^in. general , : 
. - knowledge of and experience witfi assessment involving bilingual 

student^,'. _ ' s 



iU'^jf}':''- /^kncwl^agi of arid /expetieriG^ 

' ■ A/-; knowieSge and understanding of the approach to co^mpen^atoxy 




. f'- ^/'kMca± in the Hon^iuia, district J^ . , _ . 
, /-/ ^- ^^^^^ of SI^5 instructiDnal objects, arid 

'/ ' v*^ seif-assUrarice arid capacit^^^ for brltical thinkirig. . L 

Bksed oh the above criteria^ riirie, iridiy4(3uais' arid 




/ / ©€?iecte?3 as members of the tes.t revtew committee;] They Inciaded four 
J^J^'r ^i^trict staff, a school-l^vel project teacher , a state vlevpl evaluation 



'Staifi? two university faculty members and thk External evaluator. 
B> ;^st Evaluation Crite - I 



a5^/ ; . Numer used to develop a set of criteria for tetet 

^valuatidri. These iricluded ^various documents produced by the. Center for 
t*ie Study of Evaluatibri of UCLA (Hbepfrier, et al., 1976), the Center for 
Bilingual Education {Silvermari^ et alw 1975?'' Silvermari* et al., 1978? 



-J 



and thei Assessment Projects at the Northwest Regional Educational 
laboratory (Naf ziger ,/ et al. ^ 1975)., the ftmerijban Psych^ogiq^^ . ; 
Association, the. American Educational Research ^ca^ociation, the Naitionai 
Council of MeasuremeiS^ iri^ Education (Davis, et^J., 1974), as well '"as 

■ * ^ - ' ■•, - • - . . -) - '-. ' 

individual researchers (e.g. ^ Madaus^,-e|:- al. r 1982) . The final set^bf;* 
criteria used in the present study thus represents a comprehensive"^ 

cbmpilatibn of ^erierally accepted test staridar^ls which had been>' field 

. ■ ■ . ■ ■ ■« . ■ » 

tested arid used iri test evaluatibri. * 

/ ■ ■ ■ ■ ■ - ■ ■ - 

'^More specific^ily^ the criteria relate to four major areas of test 

characteristics: measurement validity, examinee appropriateness, 

"'■-,-■'»".■ 

technical excellencer and adroinistrative usability. The criterial areas 
are further described as fallows: \; . , ■ >;a 
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M^sutrement vaiidity . This set of criteria Ibdks at the nature bf • 
what ^ test measures^ the rahge'bf behaviors .sampled, the relatiohsMp of 
^he test score to other measures, and the demonstrated usefulness of. th^ 
test in theoretical' or practical settings, ' • . , ' 

Examinee— apprbp r iaten^ss . * These criteria relate tbv the., 
appropriatenesis bf the test materials, including content of the stimuli 
{items) arid mode of response, relative to the grade level of students * 
taking the. test. / " . 

K ■ ■ , ' ' ■ 

■Jt ' _'_ ■ _J_ _■_ _ _-, 

Administrative usability; These criteria deal with practical 
concerns in administer ing and' Using' a te^ ^ith which the test 

can be gfven, scbred, and interpreted , and the usefulness bf the 
resulting scbre in niaRing program or instructional decisions. 

Technical excellence. These criteria are coqcerhed with Che t^t^'s 



reliability, ■replicability and refinement of measurement. - 

Each of the fbUr criterial areas included several individual V 

■ ■ " ' I* . , ■ 

critetia. Tto facilitate test reviews 'the^e individuii criteria Wire ' _ 

• _ __■ ■_ ■ ' _ 

transformed into quegtioris to guide the test reviewers in test evaluation. 

-e — - ■- - ' ■'■ ■ ■ >■ ... .- \ - . - ; - • ■• 

C.^ Test Review . » • . ' : . , 

The review sessibn was preceded by a tefit administration^ 
^emonstr it ion provided by one of the aistrict^ staff . The dembnitratidn ; 
was prbviSed to enhance committee members' understanding of the BINL. 
Strict protocols were observed during; this reviews session i . Coim 
members fbllbweddirectibh^ developed specif ically* for the test review* 



Criteria , to be used to evaluate the BISIL were explained to committee 
members by the external evaluatbr. Ml evaluative decisions were based 
on information pre seijied . in th^^ Manual and -c^iatedr^^d supplied by 
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the test developer. N6 attempt was iriade to vefrify the available , y 

irifbrihdtibni On £he other hand, when needed ifi format ion wa 

'- ^ — ^ ^ -— ■ — — 



available and was not readily inferrable from existing data, an - 
unfavorable rating. was^tb be given.' - 

In assessing content validity of the BINL, the 40^ starter pictuxes 

'served as test items in the review "'process. The item^ were assessed in 

• ■• ' , 

•_ '__ ^_ ^ • ■_. _. . .1 ■ 

terms of the degree of match with a set of instructional objectives 

_ . ' _ _ _ _ _ ■ • _■ . . . ' . __ _ 

provided by the SLEP program staff. ^ These objectives relate to students' 

ability to: ^ . . ' ' 

- express feelijigs> attitudes^ and meaning Shrbugh a variety of 

- ^ :' 

sentence structures; ' 

* - ' *• • 

- make statements; ' ; 

■ ■ _ ^_ '- -'s: ' ' ., ■ ^- ^ ■ 

-^sR questions; 

interact With others to cbrivey a message; and . 

- express ideas effectively and clearly. ^ - '\ 
For each BINt item^ test reviewers were aisS^ed tb determine whether 

the atem provided a teLevarit measure of at least one of the SL^P« > 
objectives. Reviewers wire also asked to determine the percentage of j ' 
SLEP*'bbjectiVes measured by bne br more bi the BINL items. 

* • ' ' ■ ■ ■ . • " . 

: D. Test-Retest Reliability 

.; _ ^.^ ; 

To assess test reliability of the BINL, a^test-retest study was ^^^^^^^ 
performed bh the test as it is used in the SLEP program. A random sample 
, bf 192 students was drawn .from different grade levels at different 

schbois. Data elements included school name, grade level of student, ^ 
. .student name, BINL raw score,. NCE score, arid BINL- lev ' ♦ 

'Data coding was per formed by staff. Completed data 

sheets were mailed tb the authbr fbr key-puhchirig arid arialysis. 



Ei MaAnstr^eam^ng * 

ft mainstrea ming stadyrwas| . perform trie effects of; the 

BiNi>s|is admeasure fcJr exiting students. Data were gat'hered fo? a Tandom 
sample^fof ove^ 200, students- Bt different grades in different schobisi 
Ohl^' students ^^'^^e^ the SLEP Program for at least six moaths' were 

inciuded, in the sample. The data included school name, student: name; 

' . .. . ^ ■■ ' -M * ■ V - • * 

, -. I* J. » ' ■ - — 

. . . . . ■ . ». . . -.^.^ ^ «, ._, ■. ^ _ ■ -.-V..- 

grade level,. BlNL administration date, BINL raw score> MAT ad|ninistration 

date, HAT -percentile scores (langqage arts, read ip§ and maKthemati 

year— end -school grades*. For purpo^es-of^ comparability^ all school graQes , 

_ :_^;.r-__^_ •_ ' . -. j - • -■•^ 

were converted to scores on a five-poiAt sc-aie (iie.^ 1-= failing r 2 - 
^barely passing^ *3 = satisfactory, 4 W 5 - excellent) . vData 

coding wai performed^l^ the distriJt\staf f ; Completed data^/shiets were 

■ ■/ " •■ ■ — ^ ■ ■ ^' ■ ...V- .-^ ■ ■ . w ■. , ' ; 

mailed' to 'the author for key-punching and analysis. W \ 



F. Tes^ Use Sjjirvey' 



Thriee 'Separate surveys we-re conducted 4ft the Honol^Jlu, L'eewafd and c . 
Centralj districts -in Hawaii to obtain a measure of test use satisfaction r^. 
Oh .the -iBIp* and t^fe t^nguage, Assessjytent Sqales (IAS) . The survey • 



instrumenticonsiste^dvOf ^41 itefns developed essentially on the^ bisi^^of 

-■- -- - ■-■ - r ^ - *■ \ ' ■ ■■'i . 

the same^set of criteria msed in €he .tes,t review study,, v items specific: 
to infor ■ 

vlaOweyej.h See Append ik A. 



rmatiori coat>itted kn the BINL 'or,, LAS^'mahual -w^e^fc^'J^ / • 



y All tftr^fe surveys were corlducted m April ^ 1983 • Data obtained from 

.^..^'^.j J ^^^^^-1 -i- - : -: ■ 

tJie surveys ''were coded' and enj;ered into the • eomputefs,^ by dfitr ict ^stgff 
€or i^reliminary tabiilatioh^^.o *^Analyse^.wef e j^er'formed sefiairately 'fof^ each 
■district atid an^^ additional^ anaiyst^>-^s Qpnducted fiy^^pbiing data -from 

» *- , . * - . ^: • _. -•- >. •. .- • ; •• • . ■..-.,■..■■.«;■•.• -fi* 

the Leewafcd. an^-Central districts i 



■ ■ ■' ■-' 



RESULTS 



, * This section pireserits the- results of each of the evaiuative 
activities performed: in the study. First r the Vfiiidings of the test 
review 'Cbinrtitti^^^^ discussed.' This is followed results obtained in 
the reliabiltty study r the inaihstreainirig study, arid the test use 



satisf actibn surveys. 



* '\^A. T^t Review Findings v 
' . ;; ■;. These findings^ ari5^ of the 

BI|iL:xas v^^ the basis of- the pre-specif ied criteria and then 

Y ''f'-;. specif acdlly in terms bf^ i validity. • ^ > 

''^V. . General qualities. Responses suggest that th^ test reviewers • . 

. generally pet^iv the BINL to be ah iristriarherit of high merit. With few.:r 

\. exoeptibris^ the test received favorable ratings frcwn^the Gonunitteev ^ ^-^ ^^^^^^ £^^^ 

- ^ members. Several items are wor'thy of particular mention. ■ . ' 
I i V , First, the;^ BINL item development process appeat^d^ 

■ concern to the committee members." While some of ^ ^.he ratibnaie coiild b^ 
inferred' frbm ,|the test m the; item deyeiopmerit pro 

r appeared poorly dbcumen ted. . . f ; • ' ■ 

r ^" Secondlyv' a§~^:o whether tf:he BINL ' 



rV 'v^; w could be expectod to ciijr relate with stliderit p^rformange in schQoL " ^ . <^ 
: : , v^ubjects; v^^^^ thSt the' BINL m^'a^ "^oraJL - . 

•' * ^ 'language prof icienpy and could .jiptvte serve a s^^^^^^^^ 

. Schievement in other academic af eas ; ' ' 



Thirdly, some reviewers^ were np^*certairf^i st^prinjg^f^j v^v 

|.:p.jPocedure can be described ask of ■ ' ^ " ' " "' ' — - - - - -'^ 
^reviewers! 



can be described ai objedtiye^lid. simple. ^ TOe i^^^^^ ..V. , 

r ..■hpweve.r^" thought it was^.U ?;. v-;. v-- -irr.:. >V. v • v' 



0 

irk: 
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Fourthly^ :while virtually all reviewers felt the nbrm groups used in 
the stari<3ardizatibri of the BINL were of sufficient ^ize/ a few expressed 



r^eservatibns over, the representativeness of the norm groups ^ 
with respect to racial, ethnic, ecbhomic arid sexual representatibh.^ 
Fifthly, a qbuple of reviewers expressed concern over whether the 

BINL wa^ capable of adequately differentiating aSoiig students at^ 

. ; " _ . 'if . . ■ ■_ L ; ; ■•■ .' ■ _ •■ . 

. upper ^nd lower ends- of the achievement spectrum and 

\ measured a diversity of skills of bilingual students; ^Amajbrity^ She . 

. ■;■ ■ '-y'-- _ _ _ . ■ •- _ ■ _■ - - ■■ ■ ■ _ 

^ . reviewers, ^^hbwever, did riot see the BlNL lacking such ca|?aibiiitiesi 

• As itientibned earlier, re^pbSses to^ost of the items to^ 

.-■ r 'X V ■ ■ ■ * ■ ' . • . ■■ ■ ■ -. ■ . , t - . . . ■,. .... - . 

v:'- • . ^ ^ . '. ■ '. • ^ _. _' __ _ • _ _ ■_ _ ■. _ ' . ■ ■ .: ., ^ ■ . . , ; 

> 'The pveraii positive perceptions of the test review qomrn^ 
. ybest?^ reflected in their reconm the use of the ilNL. in the SLEP 

■ ^. J,-' "■ ■■■ - . ■- . ^ . ■ ^- _ _L_ _ ^ • ■ • ■'. 

* Coritgrit validity.. . . Cbnterit validity .was ^sess by pbsing two ' 

* questibris to the test reviewers: of . the BINL items - . 

' appears to measure one or more of' the SLEP instructional objectives as 

■ . ; ^ • . •. .... ... . ■ . .. .. ■ . _. ^ -- ■ 4u 

a identified by the pro^iam-^^taff? (b) What prbpbrtibri of the SLEP ; 

- -'.t - - - - J_ "1 __ _.<r' ■■" " ■"' _ ' '_ '■ " ■ " _ _ ■ " ' _ ' 

c instrUctibrial bbjectives is measured by brie or* irrare 'o^ BINL items? A 

As mentioned ear|tier, for purposes of this study, the ^0 starter 
' si. pictures^ being the~. primary stimuli for eliciting student 'responses; we re^ 
regarded as test itemsi- . -A set of five major instruqfional objectives was 
. ide^^^ by the SLEPvprograin st^ff* Tfegse objectives pertained^Jb the 

■ ■ ■ ■ ■ ^- ■ ■ ' : 

studfnt^S:. Ability- ;.to^^^^^^^ ;-v. ■■ ■.■ ■ ■.. -.. ; 

. . V .ggntenci- .Structures- ■■ ii^V' -. - ^ , 

■.V>v'. i^i. ;\-^nra,^ ■ statement ^: > - . - '1,. : V ... :i - 



EI 



interact ^wi^Mothers convey a message; an<ri 



5; esfpress ideas: eff^jctiveiy and clearly; ^ * 

■ fin examiriatidri of tthe bIj^j items relative to the SLEP iristruc^iohal 

bt)jectives produced a general^ consent the test reviewers .with; : : 

; ; -w .■ : .r'; '.\-, L -[j^Ll' ' I-'- --^J __i"""''V"" 

' ■ irespect SpeciEicatly^ ail of the. • 

. : .Bi!^ to be related to ';S^b';^ 

^ the items was: perceived to be a measure of '^^ahe's ability to ask qu^tions : 
(objectiv.e 3) or to intent aejb* with others (objective the ; 

; . >; testing Situation ^ay yield an indirect indication of such abilities/ 

PurWermbr^V th'te ability -to express; feeit^ngs ^ and attitudes (as distinct , 
from meaning) included in objective 1 is only partially or indiriect^y jf, 

• - a;^^ tK^ BINt itemsV ^\ 

Thus it appears that all the BtiJL test it^nis "(^tarter pictures) 
relate to some of the SLEP instructional objective^ and are ^therefore 
^ capable df providing a measure^ of the student's ability in the relevant 
skill area's. Oh the btllet harid^ not all t>f ;the SLEP bbjectives are " 
measured by the BINL items. Two bf^ t five bbjectives identified by the; ^ 
program staff are in fact only indirectly^ ifrat ^"11^: a^g^sseS 6^^; the^. r , 
,BINL ■itenfSi.-.. ; . .■. ■ V\:.. ^ ^ ,'■:,;.. 
^ '■ ' ■ •» . ^; . . --■ . ^ , ' ' ■ ■• . ■. * , 'i' ' . ' . . ^ Jj • . ■ 

* B^ Test-Retest Reliability 

; The random sample chbsen fbr the test^retist reliability S;tudy 

cphsist^^of 1 drawn ^6m all grad^e levels (k*-12) in 2 

. .' ; 5^ Numbers of eiementaryf Ihtermediate high school students 

^ . : included in , tfe^ .i^cmiple .were projipr^iOT students in- t6e 

district at the respective grade levels. Mb'st bf the- students Were. 



tested on Jaauary 3 through January 14^ 1983 for the firist te^. fhe V 
retest^ in .most cases > / was administered between January 3i throagH 

ebruary i7i * Some of the delay in the fir.st and second testing^ was due 
to logistical problems in recording students ^ames , ah the 
schools to retest the student$* All«?. testing was conducted by the SLEP 
program staff at the school sites, , Th^e completed score sheets were 
machine-'scbred by the test publisher^. in California, 

:■ ' The data indicate that the averagfe BINL raw score (lahguage ^ ^ ' 

cbir^lexity) for the first test was 90. %3 w corresponding Norm^^^^ 

Curve Equiv^i^St (NGEj scpye of 60,93 ftt retest, the students obtained 
an av;erage BINL raw scorg of 99.58 corresponding :t^ an NCE of 66^44; The ^ 
gain .wak probably due^^^ Jiime between the two testings^^^ The 

interval was> in some cases^ longer than ahticipated and a greater amount 
of learning than' expected could have occurred, i ,. ^ ^ 

The primary interest in the test--retest teliability study is of 
course the ihtercorrelations among the^ y^^ in the stUdyi 

particular impbrtarice is the correlation between the first testing and 
sedbnd testing. Data show that the testrretest cbrrelatibri in ternis bf 
BiNIi raw scores Was i:88,; The correlation in terms of NCEs was i87,, > - ^ 
These test-iretest reliabilities should be viewed with some caveats. • 

First, different methbds of dbtainirig test reliability generally - 
yield- different results. The parall<sl forms .cdrrelation is typically the 
lowest and 'the cxad-even (e.g. ^ split-half) reliability the highest 
(Gullikien, 1950, p, 215) . The test-reteit reiiabiiity c^ 
obtained in the present study is probably, quite cotnparabie with the 
split-half coefficieht of .'92^ in the BINli test manual (iferbertV 
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SecOT of oral language tests "^re generally lower than 

itibst stand oral language proficiency being-a 

; ^ reiativeiy more difficult trait to measure (Silverman: et ai., 1^76? 

Periffiari and Rici> 1979). 
./-■^^ ; .1^^^ time interval between the first test 

?; arid : t;he ^^^^^ than arit icipated St: least in sbSe cases . While 

!^^^^Jie'* elapsed time wa^/ by : all^^^^^^^x^ tb^^seribugi^ ■ :• ^'^'^■^^•/'^ '-^ 

• ' cdnfdund the results, it^^^^ served to attenuate the . t^ 

reliability."' ■ \\^-'^ ' ' :'. v"/ : 



C. Mainstreaming 

ft random sample of 236 students was used in the mainstreamihg study. 
These students were mairistreaihed between Septeniber 1980 and June 1981. 
The sample covered all grade levels (k-l2j and '40 of the 55 schools in 
the districl^ Numbers of eiepientary, intermediate and high School 
students included in the sample were proportionate to SLEP students in 
the district at the respective grade levels. Farririgtbri High School 
which enrolled 14 percent bf the SLEP students in the district was 
slightly overrepresented in the sample; Twenty-two percent of the sample 
was obtained from that school. 

in selecting the sample, student folders weire randomly picked frbm^ 
file bbxes coritairiirig^all exited studerits during the specified period. 
No attempt was^^^M randbmly Select students frbm the various language 

grbtips. An examinatibn of the final sample by the SLEP program sFaff 
indicated that the sainple did appear representative of the language 
groups in the district. 
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After the sample was selected r year-end grades ?br the 1581-82 school 
year were obtained; These grades represented their post-SLEP achieveiheht 
in the re^galar school setting after the students heK3 been rtiaihstreained 
for at least a year/ fit the secondary level/ grade point averages for 
mathematics arid language arts were obtained. At the elementary level, a 
Single grade point: provided? language -arts 
- ;^as.;:.;d xvtpB^i itn^p adxncf v^^ii^: --s^ % 
wWph provided a s in^^ language arts grade;. p^ - 

The somewhat (3iveEse ■ graaihg sciiem^^ Used a£ different sSiiibis w^ 
conx^erted tb^ cbnimbri f as follows: ; > 

1 = Failing ^ 

2 = Barely passing ^ 

3 = Satisfactory ' ^ 

4 = Very ^bod ' ' ' 

5 = Excellent , 

The exited students might have been tested with the BINL oh several 
occasions. In such cases, BINL scores obtained immediately |)rior to 
mairistreairiirig were used. Both BINL raw scores and BINL levels were 
provided. . - ' * 

The primary interest of the mainstrefeming study was to assess the 
effects of the BINL (in conjunction with ^he MAT) as an instrument for 
exiting SLEP students. More specif icilly, the question bf primary 
interest was Whether students ihainstreamed bn the basis' bf the BINL (and 
MAT) were perfbrming satisfactbriiy in the regular .^school setting. . 

The data suggest that on the average the exited students appeared to 
be doing mo^e than sa in trie regular clMsrbbm in all subject 



areas^ particaiariy in mathematics and; language arts. On a fi\7e-point 
scale, the avierage mainstreamed student earned grades ranging from 3.19. 
to 3.62; When the data were further analyzed in terms of perclht of. 
students achieving various school grades,- a similarly posiibive 

■■ :i , : ■ _ ■ ■ ■ . ■' 

achievement pattern emerged. Data indicate that less than 4 percent of 
the exited students werLjac failing^in some. subjects. The 

.predominant ^t^^^ |^is^pqtqr^,:dr;^^b^ than > ; / 

satisfactory achievement ^ 85^ t petc^H^^ '^^r i-'^ 

percent ih lan^aa^ 93?C p^rd^nt in reading, ;^l;i pfercen in 

sp.eakirig/listenih one-half of the students showed "very 

good" or "excellent" per fdrmahce in mathematics (46. 5 percent) and ; 
language arts (57.^ percent) . Over one-fifth of these students hacJ 
similarly high achievement in reading ( 28; 4 i percent) and speaking/ 
listening (2e;5 percent),. . 

D. Test Use Satisfaction 

The survey on test use satisfaction was conducted to assess how ' 
satisfied the SLEP staff were with the use of the BINL in the program. 
The Inventory of Tbst Use Sat isfaction (lOTUS) was administered to all 
SLEP program staff in the district in April 1983. Similar surveys were ' 
^Isb. conducted at the same time in the Leeward and Central districts to 
provide data -for comparison purposes, fts indicated earlier, items in the 
lOTUS were developed on the basis of pre-specified criteria for test 
evaluation. The instrument consists of two parts. Part I is made up of 
6 items relating to the respbhdeht's general khbwledg of and experience 
with the test in questibh. part II cbhsists of 35 items mostly rela^ 
to the specific^test evaluation criteria. The following is a 
presehtatibh of major .findings. 



_ .. . ■ — - • - / -■- - .^ .» - ■ . — . ^ 

Sixty-one SLEP program staff dn, the Honolulu district responded to 



the su¥yey. These include four educational assistants'^ 12 part-time 
temporary teachers and 34 teachexs. "The others did hot specify* /their job 



positions, ft predominant majority of the respondents (83.7 percent) 
rated their knowledge of the BINL as good or excellent. Over 90 percent 



had administered the BINL 8 or more times. A majority (83.3 percent) 
reported that it took 20 minutes or less to administer the BINL. 

Over one -ha if (57..8 percent) of the respondents indicated that at ' ^ 
least 50 percent pf the skills taught in the project were measured by the 
BINLr with a sizeable number (29.8 percent) indicating that 71 percent or 
more of the skills wesre covered by the test. With respect to test use> 
equal emphasis appeared to have been placed on evaluation (80 percent); 
student selection (70 percent) , diagnosis (68 percent) , instructional ^ 
planning (68 percent), and student placement (90 percent). : 

With respect to measurement validity, the responses were highly 
favorable, in all cases, a majority of the respondents felt that (a) they 
knew what the test was supposed to measure (93^4 percent) ; (b) the items 
*in the test seemed conciptuaiiy sound (54. 1 percent) ; (c) the test 
measured what it was §uppdsed to measure (78.6 percent); and (d) the test 
measured something distinct from what was measured by other similar tests 
(51.7 percent) . The respondents seemed less certain about the ability of ^ 
the test in^predicting how well a particular student would do in other 
school subjects.: Jiess tha^ (24.5 percent) of the respondents 

felt the test possessed such predictive validity. ■ ^ 

« ft liredbminant itiajority of the respondents (85 percent) indicated that 
the BINIi provided reliable information for it ^ usei Such uses 

. ■■ --i^.. ■ ' ' ' ' ■ . 



:i5 



inciaded evaiuationr student se±ection> p.iacementr diagnosi^^ 
instructional planning. Virtually all (95 petcentj ^df the respbridehts 
reported tnat the test results generally turrieS* but ^ t^ wblild 

expect. Over 88 percent indicated that: Vhey genetall5^;miaae^^^^^t^^^ the \ 

_ __ _, . _ . ^ ■ , ' [ ^ ' - 

test inforinatiori in ^oine way.' ' ■ • < ; : 

Wit^ respect to examinee appropriat-eness, a majority bf the ' '"' '".r'']^ 

respondents (73.8 percent) felt that th§ laybUt bf l:h^ f including ^ 

print -size!' illustration use of whiti space ari^« color) was attraQtiv^ 

and helpful. The respondents r ho^ appeared ha\y.ng,dif4iciSity irj ^^^^ 

treating jtto^starter pidtpres as "items''_- and inosts (7^ 

^respond to the questioh-'-regarding how well the items were written. iSbme =° 
did not respond to the* question regarding item releVahce br ,^^ biasii , > 
Over one-half (50.8 perdentj did indicate that the. items appeared fiee, of 
cultural, '•sexual and ethnic; bias. . * • ' , " 

Virtuaiiy ail (97.3 jjerfientj respojidents* indicated they had lyof-^ ' -.^ \ 

■ ■ ^ ' ' ' ' ' ' .'■ ■ 

^difficulty in admirtister ihg the te^t tb students. Mbst (83.6* percent)' 
reported that they were able to adraihister fiffe test in the s^ihe way each . 

time, they tested the students^ with 59 peYc^nt indicating that • 

■' ^ ^ ,■' ' • -. ■ - ■■ ■■■■ ■ ■ - ■ " . ■ 

' administering the test was an enjoyable and rewarding^experie^^^ ' 
predominant majority ,j;88* 5 perc that the way in which - V^^ 

students wereVrequirid to ^respbjid. tb the test it^itis was simple and » * 
direct, i^ss ^hah 10 percent felt that. i^B took tob long to adminsfier the^ 
test. ■■ >■ V'... ■ ' ^-^ 

The BINE also received very^^favorable ratings in terms of 
administrati^vg usability. For ihst^hcetr th^ te^t: manual was rated as 
cleat, . Wll-^drganized,^ Gbhsisten thbrbUgh .and helpful by 82;8 percent 
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• ;.; of the re5p6naerits. Siihi the instructions for administering the 

- : * t^^^^ and easy to follow by 95 l^orcint bf_ the 

. rispondentl; ^ The answer or scoring sheet was easy to use/ according to 

■•- '\ ■_ v^;; - f 

/ -^nbthet is^ those responding to the surveys ; . . ^ 

; -. 'iWhi^^^ percent) of the respondents; per d^ived the ; 

scb^^i5l§^prpc straightforward and objective,, a 'sizeable number A^(41^ 

perceht) did not: indicate whether it would be difficult to^ h 
' -^^ tfiey had; never "^^^r 

v ih^^^^ be no difference between^ hand- andi machine-scoring^ ^ 

felt the test provided an ■ 
important source of information fqr pr^^ram.imp^ A majority 

(68.8 percent) ' reported /that they bft^ri used the test results to make 
^ ' instrudtioriai decisions. ' » ' 

^ ^ ' * A sizeable numBer (41.7 percent) of the respondents apparently'' had 

never had "to convert raw scorei^ to harmed br ihterptet^ed scbres fbr the . 
BIOTj ^rid did hbt respbrid to the guestibn regarding score conversion. 
Forty percent, - libwever/ did respbnd^ indicating thSt the score conversion 
; J)rbcess was Sost respondents (70 Jjercent) felt that it was easy to 

understand the various scores provide^d by the test,, fi majority (76.7 ' 
percent) indicated that they saw ipo problem in using the various test 
• ; scores, for ^ purpose^ such as evaluatioTi, student selection/ 

placement rdiagho and iristructibhal planning. 

. . ■ '."*^^«- - -^ ■■ - ■ " . . j' _ .. • _ ■ ^ ■ 

with, regard to technical excellence, * the respondents felt that the 

BINL had enbu^h'.items tb include a sufficient range of -difficulty (60 

^percent) and that both" the raw scores and converted scores had a 

sufficient range to differentiate adequat§iy among stude^ (78.4 and 

^73^4 percent, respectively Hbweverr a subsb^htially lower percentage 

^ - (40 percent) bf the respbn'dents believed that tfie Bim» measured a wide 



range or diversit^y bf' sRiii^^ About . 10 'p^rcprit indicated that the ebfetis 
" of £He t'e were tod high fpr the kinds of iriformatibri it provided, 
ftpproxiinatily 30 percent felt otherwise i Oh^-Ehirdsbf the respbhdehts 
did not respond to the item*? ' ' ' ^ - ' 

The overall perceptions on the BINL were perhaps best" reflected in . 
the respondents' expression of satisfaction. with"^^ use of the test in ^ 
. the StEP program, ft: predominant^vSajori^ indicated the|p| 

" were Batisfiedi^ ft fe^^3i3 perGent). felt otherwise? the others ;(i8;3^ 
\ percent)' apg'arently did not have strong feelings one way, or "the other. 
Over two-thirds (70 percent) would recommend tjie test for use in programs 
similar to the SLEP Program. / 

fts indicated in an -earlier section^ test* use 'surveys we^re also \ 
conducted in the Leeward and Central districts in which the l^S was used 
for student selection ahd other purposes. 'The combined sample ^of 81 
' consisted of 35 part-time teachers and 27 permanent teachers. The others 

did not specify .their job positions. " ' . 

- j_ ^_ ■ _' ^_ ..; .... ... ^...■- 

Based on the survey results^ several items appear worthy of mehtibn' 

for purposes of comparison. These' points ^6f inte'rest are listed as 
fbllbws: ^ " ^ ' 

1. *^ The SLEP program staff 'in the respgctd^ districts appeared 

quite comparable in terms bf bbth their^l^wledge of and 

experience with the respective tests — most having administered 

'■■ ' ■ -■- _ ---^ - - ° ; -- - - - - . . . 

the BINL or the LAS 8 or more times. 

• ^ ; • . ■ _ _ 

2. A cbniparisbh bf responses on content validity suggests that 
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there Was a better match between the BINL arid the Honolulu SLEP 

•• ■ • ■ ' , . II . • • 

■ _ ' . , . • . J.. >■ _ . . . ^ ^ _ , 

PjrQgram.. While over one-half (57.8 pefqerit) of the Hbriblulu 

respondents indicated a match" of 50 percent pp bett»7' brily 

.^slightly more than one-tfiird (35^1 percent) of the / " 

_ \ •_j__^_J I __ _ _ • 

Leeward/Central fespondents f^it the s'hme way about the LAS. 

' ' ■• > ■ -^^^^ ' ' .^ J \ ■ ■ 

Furthermore, proper tionately'^fl^mre Honolulu staff used the\|lINL 

;fpE^diagnosis (68V3 percent) and, instructional planning f68,3 

percent) ^s: co^ared : with Leeward/Gpntxal staff using tfie LAS 

for^ similar purposes (S3.<) percent and 4^2 ;5; petCen 'f 

respectively). i. L 

With respect to measurementsMvalidity; pWf pp^ 

percent) 'of the Hbriblulu respbriderits believed that the BINL 

.measured something distinct from what waS measured by other 

similar tests. Approxifnately to one-third (32.4) of the 

Leeward/Central 'respondents' felt the same way abbiJt the LAS. Iri 

both cases^ a miach Ibwer percentage of respondents (24. 5. percent 

for HonolulUr 27.2 percent for Leewdrd/Central*) beJieved that 

^the respective tests provided resuits'^capabie of predicting how 

w^i students may perform in other schobi subjicts. 

A substantially higher percentage bf the Honolulu respbriderits 

(85. percent) indicated that their test prbvided reliable 

' - / - __ ■_ . 

iriformatibri fbr its iritended use. Only 55 percent of the 

Leeward/Gentral respondents felt the , same way about the LAS. 

Also, proportionately .more HonoitiliU respondents (88.1 percent) 

reported using test irifdrmation they received frbm the BIVH^ 



iridicated using test inforihatipri prbvi<3ea By the ,L.ASi 

5. Therfe were aiffereiy:es in perception withi respect to ^se anS ; 

appropriateness of test administration. Again, these r. 

aif ferehces v^^^^ fkvbr of the BINL. For example, - 

virtually all (98.3 percent) the. Honolulu respbhdents indicatecl 

J ^ tfiat .they ^^ad no difficulty in administering the BINt while 81^5 

percent of tbeir^ LiBeward/Geritral counterparts f^^ 

'" about th^ tSS. Fufthermore, 83.^6 percent bf the Honolulu 

respondents, reported 'that they were able to administer the BINt 

in the same way each time they tested ^their stSdents. The 

. corresponding figure for the Eeeward/Centeral respondents was 

-72.8 percent. Apprbximately 88 percent of the Hbnbli&u . ; 

respbndehts believed that the way in which students Were ' ^ ' 

required tb respond; to the BINL test items was* simple and 

direct. About 76 percent of their counterparts in * 

Eeeward/Central felt the same way about the LAS. ^ - 

* ■ ' - • ' ■. ' ' , . . 

6. Comparisons with respect to the test manual , instructiorii^, fbr 

test admihistratibri^ use bf answer sheets and scoring procedures 

were alsb generally in favor of the BINL. For instance, 82.8 • 

; ^_ ■ ■ •_ - ■ 

percent of the Honoiaiu respondents agreed that the BiNL test' 

■ _ r- . _ L . _ . . . ' : : . . . . ■ . ■ _ 

manual was clear, well-or|^ized, consistent thorough srid 
' helpful.; Abo^t 68 percent of the Leeward/Central respondents 

felt the same way abbut the LAS. Practically all (95 percent) * 
of the Hbhblulu respondents indicated that the instructions for 
admini^er ing the BINL were cle^'r and easy to fbllbwi About 88 
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percent.' of the I|feewafd/G^^ 

abput th^' LAS'* Simil^r iy> highi$r'petpem;a 
respondents . (78; 6 g^^^ 65^4 piirceilt) indicated that 

the BPNIi' answer !> sheet was easy to Us^> .•Alt^^bugh* a sizeab%e ^ 
pumbe^r':;bf the respondents Ibb^ Hbrtplulu ^^hdHH^ " Kils 
-lif eward^ejiiral not respbrid jbd. th4 iteiti,^ a^^^ ^ > ^ . 



> - ^;;»^ V/r,/ percent Honolulu^ ;^i:^pond^t$- (60*^; peircfejit \^rius 35 •0^ 

^ J;^er5ent) ^agreea^ that^hfe' scpVip^ l^rop^dui^'i^^ fon 'tlie BINL wa§ ; 

[ stzraighttorward and bbjectiveV^j^^ 

; / . . that they would haye^nb di iiii- hand-scoring l^^e 

, f - ■■ -'..v.,;- ■ , -5- ■ * \ \ .:■•:« ■-■ ■': . . 

■ ■ :-:^^'beSt;' ■ ■ i " : ': ■ " \ .'^ys-': ' : * ' ' "V'''- ," p ' -'/';■.■«»■ 

. ; ri.. ' in bth^r aspects of adminis^tratiVe usabilit^^^ 

\ . r ; - also generaij^ jnore in f ayqr . <rf / 
base in point j^ abcr^ Hb^blulii resp&n^ . 

; ' repKDrted 'that it was^^^^ tb^orined or cAi ^ » 

interprete|a>scbres for tlie BINL. : Apprpki > K 

the Le^ward/Gentral 'ri5i^ond^*its *dni3icate^ ^* ^ : 

:v X V Propbttipnately, .a 'fal: greater number of the. Hohbitiiu ; * v ^ * l^'i^^; v 
re^bnderfts indicated thatr it was easy*^ tb under starp** the ;,ine'^hirrg^ / - 
; bf the v'aribus scores proyided. by ^he (7di0 pefcent^versS^ V' ' 

J J 5ii2 pe^rcent) and that they saw. no problems .in u&ing the various . ' '-'-.^ 
^^test scores fb int^nSe^ pur^ percent versus 49.4 \ 

pWrb$rH^) r^/^ prbpbrtibh of the Hbhplulu ' 

^ : t the^ t&t results tb make r :^ .;; " 

instruGtabhal dic^ feheUip teeward/Centl^^ - v 

^ - " ' '■ ■• • \ ^. ■ ' ' - '.' , ;■ ' ' ' ■■ :• . ■ V/ , • '; 

J; / counterparts C42, 5 pe?cent^^ ' ; - % * ' ^ : - 
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1 ' r ^-^ :; . re^pdfiderifca: did riot respond to' the ;^8i0ri ^ 41^ p^^ of :Jbf¥iin : did > 

^ -1 ' V ' r^^ itlmi were rBievaht^ tp thejrvs 

' ' ' VJ^^ 3^ p^ce'rit of -tKe ,teeWard/Centra^^^ " 



V "^i^^^'^^SlT^ of the -respond^ nts, ( M . 8 ^ercen^?: f or Hbnblulu and^ *6 - 9 

* Vkv '-^^^^^^^ percent 'for 'tiewSrS/^ tfiat t$e te^t iten^s wer^ 

vv^rs .,.. ' .. . : f reerpf -ciiitdr^ ' 
' V respdndeht:S- ( i;^^,'^''^^^©^^ ?Sl9 fjetceiit) felt . a. * ■ 

"■V , • s " v that the." it'ems were^ h^ of bias. , ^ .r 

: 9^^v ^hece a^^^ some ^viderice that the BIN£ provided a' 

v ^^v:^^ ?wii3er rsrtge of 'coveirag^ than % ft predominant majority of 

; ' . ^ t the BINL raw scores 

' y i^)'^^:l%^^iy^^ (73,4 percent) had a 

^: ' X; * . ^^£ici^^t^^n9e to differentiate adequately among students. 

' / - ^ ^ i^fefSs tfi^ 34;7 percent^ respectively) 

yu: ^ ^^^ ^ : of the* LeewsrO^^Ceritral respondents felt the same way about the 
•^ V, * ^ ■ ' raw a About 60 percent of the 

Honolulu re^pbrva^nts- agf ^ed that the BINL had enough items tb 
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-3c:epresent a^^^^^.^^s range of difficulty. Approximately 40 



^'pj||^t^et^t indicated tha't the test measured et wide range or 



\- ^*"'^^'iaiversity df^ sjcills. The cbrrespbndihg figures fbr» the' 



^*':'.^->^:;. ^ -.1,; . j 



iijeewarcJ/GehtSal respondents were 43 . 7 percent and 37 .7 percent / ^ 
, * ] ? respectively. It should also be . noted that a* sizeable number bf ^ 

-0 "^^^^ the (28.4 percent in Honolulu and 41.6 percent in 

^ • # did not think the respective tests measured^a 

::^V; . "-^^iA^ range or diversity of skills. . 



■ ' . . 



^iO; In; the Honoiuia district^ a predominant majority (78;4 p(Brceht) 
I'/y:- of the responaehts inaicstssd that they were satisfied with the 
- use of the| BINL in the SLEP Program,- vdnly 3*3 percenf expressed 
di^sat is faction, in the teeward and Central dist^ts, 36.2 
percent of ^he respondents reported that they were satisfied 
with; the use of the LftS in^^ t^ 36.2 
percent expressed dissatisfaction. ' About 70 percent of the 



' Honoluiutrespondents w6t3^d recommend the use of the BINL in 
programs similar to the-SLEP Program. * Approximately 34 percent 
of the Leew^rd/Cehtral resppridehts would recpnim of 
. " ^ the IJ^^^^'' ■ / 

While it is possible to overinterpret perceptual data> . the responses 
obtained frcp^the three surveys did present -a clearly discernible trend 
V supporting the continued use^ of the BIIJL ih/the Honolulu district. Not 
only did the respdhdehts thihR hig^ the test/ comparative data 

suggest that their sup^Jbrt afid enthusiaiS tb be 

• greater* than that es^ressed by uiers o^ LAS for that* test in /the 
Leeward and Central districts. In most cases; survey re$pbhp^ 

■ ■ . . - ■ . • ■ . . ^ • ■ ■ ^ - 

-- -■- -■— - . ■ : V ^ ..: \ ., . : . ■■ ^ ^- 

favorable tb the BINL than they were to 't^ : . ' 

-.- ■ -^-'ccsiCLusjONs;'"'^ ^ - -..■y^- . 

The primary pt3r pose of the present study was to obtain and interpret 
data pertaining to the BINL -to help determine, whether the cbhtihuf^ 
of ,thl test in the Hbhplulu district is warranted. /Tb that end several 
studies were^ conducted to obtain information^ on the psychometric 
qualities (particularly with respect to the bbntent validity ^nd , ; 
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test-reteist reliability) of the BINL, the effects of the BiNL as an 
ihstrumerit for niairistreainihg SLEP studehts^^ aha'perceptions of test users 
with respect to their overall test use satisfaction. - - \ 

S&verai approaches were ta1<eri to obtain the relevant data:^ iricludihg 
the formation of f a test review cbnitaitteei the testiS^^ retestang of a 
randqiti sample of SLEP studentsy an an-depth examinat.1^^ 
of a random sample of exited ^tiadi^ht^ cbHdUct b*^ surveys, j^n^^-^^ 

use satisfactiipn in three schobl districts. Results: of: the^^^^ 
to supporf the following conclusions: ' ; ^^^^ . 

, i. Findings obtained from the test ;reviei* session^ s the 
BIlvrL f^^^ as a measure: of ; 

orai^angiiage -prbiic match between p^ 

' in the .test and the piychoilnguistic pr ihciples which form the ; 
V Vphilosophical bases of /^e S^^EP Prq^^ is considered excellent^ 
■ ^^nd: perhaps unique> .p^ ^i^* 
It is also bbv;iQuSr hbweve^^^^ BINt items do not measure v 

^ : ali the>/skiil^ w^^ iri tj>§ . SLSp . - 

bine's cbnterit cpverag^^^^ to oral' prbf idie^^ 

in English and dbes oib^^^ as^^r^adirig 
•. cdriprehensibn. ■ f'--': ..^ ''"[':\-^.\' ;v-'"-/~ ' ' ■ \ 

• .2. . Results bf the- iest^r^tea^ 

slanguage prof iciency test possesses^ ah adequately high degree bf 
reliability. ; in spite of the attenuating factors which, 
\ .ihadvertentXy bdcurred in the study, a:^^ ' 
. CTOfficient bf arburid .88 was bbtaihid fbi th^ test, it is also 



hoted^ however^ that responses obtained from the test ase surve^ 
suggest thi^t- the scbting procedures were perhapis not as 
straightforward and objective as they could, have^t^^ : 
The ma instr earning study showed that in 11105,1? cases exited 
students were performing satisfactorily> : if not better Cth 
sSri^^tij^rilyT^^ 

-school grades were cbriverted to i cbmmbh five-point scalev^^^^^ 
average perfbrmarice was indicated in all subject* areas ihcludjsd 
in the study. Only in very few cases (less than 4 perc5ent| were: 

- exited students shown to bewailing in some subjects, 
sizeable riumh^r of the exited students (20-57 percent) were 
dbihg "very good" or "excellent" wbrk in. the regular classrbbm 
foiiowing their exit from the SEEP PrograS; * 

The test pse surveys conducted in the Honolulu, Leeward and 

- ■ ' . : • • t 

Central districts indicated that there was a high degree of 
suppbrt and enthusiasm bri the part bf the program staff for the 
use of the respective tests (the BINL and the LAS) in these 
districts, it also appeared that the degree pf support and 
enthusiasm was greater in Hdndiuiu th^tT^fh the other two 
districts. There was clear evidence that the Honolulu program 
staff were highly satisfied with the use bf the 6INL in the SLEP 
Program and believed the test served ail the functions it was 
intended to serve. v 
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DiiffiCTioN s ^pgR^aaiPLETiNd Tm inventory 

JSJB SATISFACTIQM 



The j^qrpose of the Inventory is to assess how satisfied ybu are wiih 
•tiie use of test^n^you fcipgegjca m . A B eparate:4nventojgpBhoulcl^be- 



c^jpleted for each testi Por this particular study, BTonolulu District 
re#ohdehts should complete tiie Inventory for BINL. lieward and Central 
Distict respondents should complete the inventory for LAS. Keep the 
following directions in mind when ybu re^hd to the Inventory^ 

1. Pill in your name (bptlbhal) > position, school district and date on 
page 1 of the Inventory w 

2. Ihdicaite tHe name of the test for whic^ ybix are con^letihg the 
Inventory, fipn^uiu District residents should complete the 
inventory for BlNIf^ Leeward and Cehttal District responded 
complete the Inventoi^ for LAS. , fc ' 

3. Be aB thorough and candid as yoti can in re^nding to the items. 
Responses totiie inventory will not be identified with names of ' 
individual respondents^ ; ^ 

4. Read* the items carefully before ^bu respond. Thrbiighbut th^ 
Inventbry the tern ;^prbgrMj* means the SLEP PrbgraSi The terra 

V ^s^l^ents^^ participating in the SfflP program or 

|tudent8 being tested^ f^ ill the Prbgram^ Unless 

otherwise indicated or implied > the test means the test for which ybu 
are completing fi\e Inventory. 

— ■ ■_ _ __ ' ^ ^ -_ ._ _' ■ ■ ■_ _ . ■ _' J* _ ' ' ' 

5. Use your general test as a guide in responding to 
the items. want your best prof essibnal judgments on the test^-not 
scientific facts. . . . ^ 

6. Return the cc^leted inventory to Dr. Don Enoli of the Honbluiu 
District offiqei 



Name (optiOTal) ; 
Pbsittbn: 



INVENTORY OF TEST gSEL-SMI^CTION . 



School District: 
Date: 



Name of test (one only) for which ybu are TOmpletihg this ii^entory: 



Check bhe of the choices for eacfti of .the following items. 

i. How would you rate your knowledge of the ^st? . 

- ■ « ' 

Little 
• fi>derate 

■ . Good / 

: ■ .* ■ ■ ' . ' ; 

' EStceilent * 



2. How many times have you adninistered the test to students iri the 

_.Never • ' ^. " ' ' ■ '■■ 
^^l^^i:, times,, / 

4-7 times 

^ ■ ., ■ 

8 or note times . ~ 



'29 , 



3i How long do^s it td aamihister the test to the averagle student? 

— ^ i^io mihutes - / ' ' ' , 

■ ^ ^^Sv - 11-20 minutes ■■ ' ' ■ ' • - "• ^ . 

21-30 minutes _ ^ 

' '. ' "■ 31^0 minutes ' 



more than 40 minuted 



4^ What percehtafe of the^^^ that you teach students in the program 

' ; is^ covered by the ^stf v ' 

• - 20% or less-. .;...""■/.:..•,- 



21^0% - V 

^ 31-40S . ■ ■ ' 



" 71% or inbre 



Si The test ts pttiaftty iised fpr all that apply) 

" ■ ^.^--^ Evaluaticat • / 

Stiident Selection : 
r ; r Diagnosis ■: ■ 



IhsjbtUiCtional ^lannln^ ' 
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^.S* HoW iong hai i*e test liein in use in the progyara f6r the 
purpose(s) inaicat6a above? -^S- v 

I 1^2 years .^ -'^ -V/ y'''-'-':^' 

3*4 years ■ . > - . ' -/ * , 

Mbre=^ha n~ ^veaj^ 



J^;^ II : 

Indicate wheQier you agree or disagree wi.Ui each of thfe following 
statements by circlirs Sft for ^Strongly Agree; ft lor Sgfge; N fof^^^N^^^ 
0 for Disagree; and Sp ror Strongly Disagree^ dirci^ iift oSiy if the item 
is not applicable or inappropriate^ . * 

Im I know what the test is supposed to measure. 



2i •Ebe^ items in the test seem obhceptually ,;86 

based on theory of linguistics/ education^ psycBoibgy ihd^^l^^ 

• " SAr & ' \N D SD NA 



:3i. The test measures what' it is supposed to measure 
\_ Sft A N D - SD NA ^ 



4, The test measures something distinct from i^at is measured other 



■Sft v. .A ■,„.;::.,, ,.N/ :^ v-::.-'^i^b;' : SD... ::■ : liA 



5. Wie test provides tesul^^^ me how well a 

particular student is ^doing in other school subjects too^- 



6. The test provides reliable infornation for its intended uii ' (e g 
eS^i^gr"' placement, diagnosis and instru;?idhal 

' SA A N D SD NA 



7. The test results generally turn out to be what I would expect; 
■ SA A N D SD NA ' 



8. I general^ Aake use of infornation I get frcm the testi 

SA A N D - 'gu NA 

9. -I have hb difficulty in administering the test to students. J 

SA A N D • SD NA ^ 

lb; I am ablg to administer the test in the same way each time I test the 
students. - 

SA A N D Sb NA 



11. The way in which students are required to re^hd to the test items 
ii3 simple and direct. 

SA A N D SD NA ^ 



12. Adteihistering the test is an enjoyable and rewarding eacperiehce. 

SA A N D SD NA ' 



I3i It takes too long to administer the test. ■■^J-:^'- - r 

SA A N D SD NA 
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14 i The test manaal: ts clears well-organized^ consistent^ thorough and 
i helpful. 



I^Sft s ft N . D ' SD ' NA . ■ ^ 



\ 



15^ Instructions for adninistering the test are clear and easy to follow. 

SA A , N " b SP. WA * . 

16. The answer or scoring sheet is easy to use. - 



17^ The scoring probed lire £dr the test is straightfon^ard and objective. 

SA A N D -SD NA - 



18. I woold have no difficulty in hand-scoring tiie test. ^ 



SA A N D SDl NA 




19^ When the test is machlne-scbr^^ the results are often sbmewhat 
different from what I' would expect. 



SA A N b Sb HA 



20. The test provides ah in^rtaht source of - information for program 
" imprbveneht. 

\ SA A N - D SD NA 



21; i often usii^ iiie test results to make instructional decisions. 

SA A N D SD NA ^ 



1 



22. |h| layout of the test Uncluding pring slzg, illustrations;' use of 
white space and color} is attractive and helpful. "^e of 



Sft ' 



SD 



23. The test items are generally well written. 



-St 



^« test items are relevant to iStud^nts. 
• SA . A N ■ D SD o 



NA 



25* The test itemsare free of cultural^ sexual and ethnic bias. 
A H. • D SD 



HA 



26. It is easy to convert raw scores to nonned or interpreted scores for 
the tests : 



SA 



N 



SD 



NA 



27. It is easy to understand the meaning of thi various scores prbVideff 
by the test. 



SA 



N 



SD 



NA 



28. I see no problems in using the various test scores for the intended 
purpose (e.g,^ evaluation, student selection^' placemeht, diagnosis, 
and instructional planning). / 



SA 



A . 



N 



SD 



NA 



ERIC 



34 



37 



29i T^e test, has ersbugh iteins to include a sufftcient range of aiffidulty. 



SA , A ' M D Sb r la 



3d. The tesl^ measures are wide range or diversity of skills^ : 

SA A N D SD NA Z 



■ 31'. The raw^ Scores hawe a sufficient range to differentiate adequately 
wnbhg: studentsi 

SA A N D; SD NA 

■4^- -_■ • _ _- „ - ^ _ ■ _ _ _ • ___ •- 

32. The converted sabres have a sufficient range to differentiate 

adequately among students. ' \ . 

SA ft N b SD NA 



33. The costs of the test (including test materials^dmihis 

scoring and ^inter'pretatibn) are ^06 high or the kinds of informaticm 
it provides. 



SA A N P Sb NA 



34. I am very satisfied with the use of the test in xoy prb^ranii 
SA A N b Sb NA 



35. I would recoimend the test for use in programs similar to mine* 
SA A H D SD NA * 



